Efficient codebooks for fast and accurate low resource ASR systems
نویسندگان
چکیده
Today, speech interfaces have become widely employed in mobile devices, thus recognition speed and resource consumption are becoming new metrics of Automatic Speech Recognition (ASR) performance. For ASR systems using continuous Hidden Markov Models (HMMs), the computation of the state likelihood is one of the most time consuming parts. In this paper, we propose novel multi-level Gaussian selection techniques to reduce the cost of state likelihood computation. These methods are based on original and efficient codebooks. The proposed algorithms are evaluated within the framework of a large vocabulary continuous speech recognition task.
منابع مشابه
Efficient codebook for fast and accurate low resource ASR systems
Nowadays, speech interfaces have become widely employed in mobile devices, thus recognition speed and power consumption are becoming new metrics of Automatic Speech Recognition (ASR) performance. For ASR systems using continuous Hidden Markov Models (HMMs), the computation of the state likelihood is one of the most time consuming parts. Hence, we propose in this paper novel multi-level Gaussian...
متن کاملFast and Accurate OOV Decoder on High-Level Features
This work proposes a novel approach to out-of-vocabulary (OOV) keyword search (KWS) task. The proposed approach is based on using high-level features from an automatic speech recognition (ASR) system, so called phoneme posterior based (PPB) features, for decoding. These features are obtained by calculating time-dependent phoneme posterior probabilities from word lattices, followed by their smoo...
متن کاملTowards large vocabulary ASR on embedded platforms
In this paper we present an overview of an automatic speech recognition system implementation in the context of embedded systems. Specific challenges presented by low resource platforms will be addressed for the basic components of an ASR decoder. Our main objective is to utilize and modify the technology developed for large vocabulary ASR to achieve efficient LVCSR on embedded systems as well.
متن کاملAlgorithms for data-driven ASR parameter quantization
There is fast growing research on designing energy-efficient computational devices and applications running on them. As one of the most compelling applications for mobile devices, automatic speech recognition (ASR) requires new methods to allow it to use fewer computational and memory resources while still achieving a high level of accuracy. One way to achieve this is through parameter quantiza...
متن کاملDevelopment and Analysis of a Novel Multi-Mode MPPT Technique with Fast and Efficient Performance for PMSG-Based Wind Energy Conversion Systems
Wind energy is one of the most promising renewable energy resources. Due to instantaneous variations of the wind speed, an appropriate Maximum Power Point Tracking (MPPT) method is necessary for maximizing the captured energy from the wind at different speeds. The most commonly used MPPT algorithms are Tip Speed Ratio (TSR), Power Signal Feedback (PSF), Optimal Torque Control (OTC) and Hill Cli...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Speech Communication
دوره 51 شماره
صفحات -
تاریخ انتشار 2009